Optimal Power Allocation for Distributed Detection 
over MIMO Channels in Wireless Sensor Networks 

Xin Zhang, H. Vincent Poor, and Mung Chiang 



o 
o 

(N 



O 



(N 
> 

O 
m 
O 

o 

o 



X 
S3 



Abstract — In distributed detection systems with wireless sensor 
networks, the communication between sensors and a fusion center 
is not perfect due to interference and limited transmitter power 
at the sensors to combat noise at the fusion center's receiver. The 
problem of optimizing detection performance with such imperfect 
communication brings a new challenge to distributed detection. 
In this paper, sensors are assumed to have independent but 
nonidentically distributed observations, and a multi-input/multi- 
output (MIMO) channel model is included to account for imper- 
fect communication between the sensors and the fusion center. 
The J-divergence between the distributions of the detection 
statistic under different hypotheses is used as a performance 
criterion in order to provide a tractable analysis. Optimizing 
the performance (in terms of the J-divergence) with individual 
and total transmitter power constraints on the sensors is studied, 
and the corresponding power allocation scheme is provided. It is 
interesting to see that the proposed power allocation is a tradeoff 
between two factors, the communication channel quality and the 
local decision quality. For the case with orthogonal channels 
under certain conditions, the power allocation can be solved 
by a weighted water-filling algorithm. Simulations show that, 
to achieve the same performance, the proposed power allocation 
in certain cases only consumes as little as 25 percent of the total 
power used by an equal power allocation scheme. 

Index Terms — Distributed detection, wireless sensor networks, 
power allocation, MIMO channel 



I. Introduction 

Wireless sensor networks (WSN) have received considerable 
attention recently. Event monitoring is a typical application of 
wireless sensor networks. In event monitoring, a number of 
sensors are deployed over a region where some phenomenon 
is to be monitored. Each sensor collects and possibly processes 
data about the phenomenon and transmits its observation or a 
summary of its observation to a fusion center (FC). The FC 
makes a global decision about the state of the phenomenon 
based on the received data from the sensors, and possibly 
triggers an appropriate action. 

The essential part of event monitoring is a detection prob- 
lem, i.e., the FC needs to detect the state of the phenomenon 
under observation. In wireless sensor networks, due to power 
and communication constraints, sensors are often required 
to process their observations and transmit only summaries 
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of their own findings to an FC. In this case, the detection 
problem associated with event monitoring becomes distributed 
detection (also called decentralized detection). 

Distributed detection is obviously suboptimal relative to 
its centralized counterpart. However, energy, communication 
bandwidth, and reliability may favor the use of distributed 
detection systems. Distributed detection has been studied for 
several decades. Particularly, the design of optimal and sub- 
optimal local decision and fusion rules has been extensively 
investigated. Tsitsiklis [31], Varshney [32], Viswanathan and 
Varshney [33], and Blum et al. [3] provide excellent reviews 
of the early work as well as extensive references. 

However, most of these studies assume that a finite valued 
summary of a sensor is perfectly transmitted to an FC, i.e., no 
error occurs during the transmission. In distributed detection 
systems based on wireless sensor networks, this assumption 
may fail due to interference and limited transmitter power at 
sensors to combat receiver noise at the FC. The problem of 
optimizing detection performance with imperfect communica- 
tions between the sensors and the FC over wireless channels 
brings a new challenge to distributed detection. 

Rago et al. [28] consider a "censoring" or "send/no-send" 
idea. The sensors may choose to transmit data or keep silent 
according to a total communication rate constraint and values 
of their local likelihood ratios. Predd, Kulkarni and Poor [27] 
examine a related protocol for the problem of distributed 
learning. Duman and Salehi [13] introduce a multiple access 
channel model to account for noise and interference in data 
transmission, and optimal quantization points (in the person- 
by-person sense) were obtained on the original observations 
through a numerical procedure. Chen and Willett [6] assume 
a general orthogonal channel model from the local sensors 
to the FC and investigate the optimality of the likelihood 
ratio test (LRT) for local sensor decisions. Chen et al. [5] 
formulate the parallel fusion problem with a fading channel 
with instantaneous channel state information (CSI) and derive 
the optimal likelihood ratio (LR)-based fusion rule with binary 
local decisions. Niu et al. [24] extend the results of [5] to the 
case without instantaneous CSI. Note that both [5] and [24] 
assume orthogonal channels between the sensors and the FC. 

Chamberland and Veeravalli [7] [8] [9] provide asymptotic 
results for distributed detection in power (or equivalently, 
capacity) constrained wireless sensor networks. More specif- 
ically, [7] shows that, when the sensors have i.i.d. Gaussian 
or exponential observations and the sensors and the FC are 
connected with a multiple access channel with capacity R, 
having identical binary local decision rules at the sensors 
is optimal in the asymptotic regime where the observation 
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interval goes to infinity. [8] considers a similar problem but 
with a total power constraint instead of a channel capacity 
constraint, and shows that using identical local decision rules 
at the sensors is optimal for i.i.d. observations. [9] considers 
the detection of 1 -dimensional spatial Gaussian stochastic 
processes. An amplify-and-relay communication strategy with 
power constraint is used and the channels are orthogonal with 
equal received signal power from each sensor. They assume 
sensors are scattered along 1 -dimensional space and have 
correlated observations of the Gaussian stochastic processes. 
The tradeoff between sensor density and the quality of infor- 
mation provided by each sensor is studied using an asymptotic 
analysis. 

Liu and Sayeed [20] and Mergen et al. [21] propose 
the use of type based multiple access (TBMA) to transmit 
local information from the sensors to the FC, and present a 
performance analysis of detection at the FC. The results of 
[20] and [21] focus mainly on the case with i.i.d. observations 
at the sensors. Jayaweera [15] studies the fusion performance 
of distributed stochastic Gaussian signal detection with i.i.d. 
sensor observations, assuming an amplify-and-relay scheme. 

Chamberland and Veeravalli [10] provide a survey of much 
of the recent progress in distributed detection in wireless 
sensor networks with resource constraints. 

In this paper, we propose a distributed detection system 
infrastructure with a virtual multi-input multi-output (MIMO) 
channel to account for non-ideal communications between 
a finite number of sensors and an FC. Our analysis does 
not consider an infinite number of sensors because in many 
practical cases, only a few tens of sensors are used. We assume 
the sensors have independent but nonidentically distributed 
observations, so they have different local decision qualities. 
Each sensor has an individual transmitting power constraint, 
and there is also a joint power constraint on the total amount 
of power that the sensors can expend to transmit their local 
decisions to the FC. The goal is to optimally distribute the 
joint power budget among the sensors so that the detection 
performance at the FC is optimized. 

The J-divergence between the distributions of the detection 
statistic under different hypotheses is used as a performance 
index instead of the probability of error in order to provide 
a more tractable analysis. A power allocation scheme is 
developed with respect to the J-divergence criterion, and in- 
depth analysis of the special case of orthogonal channels is 
provided. The proposed power allocation is shown to be a 
tradeoff between two factors, the quality of the communication 
channel and the quality of the local decisions of the sensors. 
As will be shown in the simulations, to achieve the same 
performance in certain cases, the power allocation developed 
in this paper consumes as little as 25 percent of the total power 
used by an equal power allocation scheme. 

This paper differentiates from previous work in the follow- 
ing aspects. 

• A system with the sensors and the FC connected by a 
virtual MIMO channel is considered. 

> The sensors have independent but nonidentically dis- 
tributed observations, and hence they have different local 
decision qualities. 



> We develop the power allocation scheme for a finite 
number of sensors rather than asymptotically. 

• To improve global detection performance within a power 
budget, we focus on how to efficiently and effectively 
transmit the local sensor decisions to the FC rather than 
how to design local and global decision rules. 

• The proposed power allocation includes tradeoffs be- 
tween communication channel quality and local decision 
quality. 

The rest of the paper is organized as follows. In Section HI1 
we introduce a distributed detection system infrastructure with 
a MIMO channel model. In SectionHnl we develop the optimal 
power allocation scheme with respect to the J-divergence 
performance index. In Section IIVI we study a special case 
in which the sensors transmit data to the FC over orthogonal 
channels. In Section [V] we provide numerical examples to 
illustrate the proposed power allocation. We conclude the 
paper in Section |VI] 

II. Models 

Let us consider a hypothesis testing problem with two 
hypotheses Hq and Hi, as shown in Figure [T] There are 
K wireless sensors with observations x = [xi,-" ,xk] T - 
The observations are independent of each other but are not 
necessarily identically distributed. The conditional probability 
density functions of these observations (conditioned on the 
underlying hypotheses) are given by p(x\Hi) for i 6 {0, 1}. 
The sensors then make local decisions u = [ui,--- ,uk] t 
according to their local decision rules: 



Uk = lk{xk) 



decide Ho 

1 decide Hi 



(1) 



where k = 1, • • • ,K. In this paper, we assume the local 
sensors do not communicate with each other, i.e., sensor 
k makes a decision independently based only on its own 
observation Xk- The local decision rules Yfc(-) do not have 
to be identical, and the false alarm probability and detection 
probability of sensor k are given by 



P F (k)= P (u k = 1\H ), 



(2) 



and 



P D (k)=p(u k = l\Hi). (3) 

We assume the sensors have knowledge of their observation 
quality in terms of Pd and Pp, which can be obtained by 
various standard methods from detection theory [25]. The joint 
conditional density functions of the local decisions are 

K 

p(u\H ) = l[P F (k)^(l-P F (k))^\ (4) 

and 



k=l 
K 



p(u\Hi) = Y[p D (kr"(i-p D (k))^i (5) 



k=l 



The local decisions are transmitted to an FC through a MIMO 
channel, modelled by the following sampled baseband signal 
model (see, e.g., [35]): 



HAu + n, 



(6) 
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Fig. 1. Distributed detection system diagram. 

where y = [yi,--- ,Un] T contains the received signals at 
the FC. A = diagjai, 02, • • • , an}, is a diagonal matrix, the 
diagonal elements of which are the amplitudes of the signals 
transmitted from the sensors. H is the channel matrix, which 
is assumed to be deterministic in this paper 1 , n is an additive 
noise vector which is assumed to be Gaussian with zero mean 
and covariance matrix R. We assume that the channel quality 
(in terms of the H and R matrices) is known at the FC. This 
information can be obtained by channel estimation techniques. 
The dimension of y, determined by the receiver design, is 
N, which does not have to be the same as the number of 
sensors K. Many different wireless channels and multiple 
access schemes can be expressed with the MIMO model in 
©, including CDMA, TDMA, FDMA, as well as TBMA 
[20][21][34][30]. 

The conditional density function of the received signals y 
at the FC given the transmitted signals u from the sensors is 



p(y|u) = 



1 



|2vrR|2 



-exp 



' 5 (y — H Au) T R~ 1 (y - HAu) 



(7) 



The conditional density functions of the received signals given 
the two hypotheses are 



p(y| J ff i )=^p(y|u)p(u|ff l ), 



(8) 



where the summation is over all possible values of u. The FC 
applies its fusion rule Yo(-) to y to get a global decision 



mo = 7o(y)- 



(9) 



The system is summarized in Figure Q] We notice the Markov 
property of the system: {Hi} — > x — > u — > y — > uq forms a 
Markov chain, which is used to derive (HJ and will be used in 
the next section. 

In this paper, we do not focus on the design of local 
and global decision rules to optimize the detection perfor- 
mance at the FC. Instead, we focus on how to intelligently 
distribute a total transmitter power budget Ptot among the 
sensors, by choosing an amplitude matrix A within the 



constraint Tr 



AA 



< Pt, 



There are also individual 



'We focus on the case in which the sensors and the FC have minimal 
movement and the environment changes slowly. In this case, the coherence 
time [30] of the wireless channel can be much longer than the time interval 
between two consecutive decisions made by the FC, and instantaneous CSI 
can be obtained. 



power constraint for each sensor, A X ^/Pmax, to ac- 
count for the maximum output power at each sensor. Here, 
v/Pmax denotes the component-wise square root of P max = 
diag{P max (l), • • • ,P max (K)}, where P max (k) is the trans- 
mitting power limit of sensor k. The matrix inequality < 
means \/P m ax — A is positive semidefinite. 

III. Optimal Power Allocation 

In this section, an optimal power allocation among the 
sensors in the distributed detection system described in Section 
HI1 is studied. We first choose a detection performance metric 
for our analysis. 

There are three categories of commonly used detection 
performance metrics [25]: exact closed-form expressions of the 
miss probability Pm (which equals 1 — Pr>) and false alarm 
probability Pf (or the average error probability P e , if prior 
probabilities of the hypotheses are known), distance related 
bounds, and asymptotic relative efficiency (ARE). 

The natural performance metric, closed-form expressions of 
Pm and Pp (or P e ), is hard to obtain even for centralized 
detection. ARE is useful for detection systems under large- 
sample-size (long observation duration) and weak signal con- 
ditions. Distance related bounds are upper or lower bounds 
on Pm and Pp (or P e ), such as the Chernoff bound, the 
Bhattacharyya bound, and the J-divergence [25]. 

In this paper, we use distance related bounds, more specif- 
ically the J-divergence, as the performance metric. The J- 
divergence, first proposed by Jeffreys [16], is a widely used 
metric for detection performance [17] [18] [19] [26]. It provides 
a lower bound on the detection error probability P e [18] via 
the inequality 



P e >p(H ) P (H 1 )e 



-J/2 



(10) 



We choose the J-divergence as the performance metric 
because it provides more tractable results in our study, it is 
closely related to results in information theory, such as the 
data processing lemma [12], and it is also closely related 
to other types of performance metrics. [19] shows that the 
ratio of the J-divergences of two test statistics is equivalent 
to the ARE under some circumstances. The J-divergence 
and the Bhattacharyya bound both belong to a more general 
class of distance measures, the Ali-Silvey class of distance 
measures [ 1 ] . The J-divergence is the symmetric version of the 
Kullback-Leibler (KL) distance [11] [12], and the KL distance 
is asymptotically the error exponent of the Chernoff bound 
from Stein's lemma [11]. 

The J-divergence between two densities, p\ and po, is 
defined as 



J(pi,Po) = D(pi\\po) +D(po\\pi), 



(11) 



where D(pi\\po) is the (non-symmetric) KL distance between 
pi and po. D(pi\\po) and D(po\\pi) are defined as 



D(Pi\\Pj) = 



1 I Pi 
log — 

Pi 



Pi- 



(12) 



There is a well-known data processing lemma on the KL 
distance along a Markov chain [12, LEMMA 3.11]: 
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Lemma 1 The KL distance is non-increasing along the 
Markov chain x — > u — > y, i.e., 



and 



D(p(x|JTi)||p(x|flb)) > r>(p(u|ffi)||p(u|/r )), (13) 



one/ 

D(p(u|fTi)||p(u|ff )) > D{p(y\H x )\\pty\H )). (14) 

This result can be easily generalized to the J-divergence with 
the following corollary. 

Corollary 1 The J-divergence is non-increasing along the 
Markov chain x • u ► y, i.e., 



J(p(x|ffi),p(x|ff )) > ■7(p(u|fT 1 ) J p(u|flb)), (15) 



and 



j(p(u|fri),p(u|H )) > ./b(y|#i),p(y|#o)). (16) 

Corollary [TJ tells us that a performance upper bound of the 
detection at the FC is provided by J(p(u|i3i),|>(u|-Ho))- This 
can be achieved only when there are perfect data transmissions 
from the sensors to the FC, i.e., the FC receives u with no 
error. 

Recall that our goal is to optimize the detection performance 
at the FC. This now translates into maximization of the J- 
divergence between the two densities of the received signals y, 
with respect to the underlying hypotheses. The optimal power 
allocation is thus the solution to the following optimization 
problem: 



max J(p(y\H 1 ),p(y\H )), 

A 



(17) 



s.t. Tr 



AA 



<Pt, 



X A X VPmax, 

where the J-divergence J(p(y\Hi),p(y\Ho)) is given by 

J(p(y|ffi),p(y|ffo)) 

= /dyb(y|^i)-My|^o)]iog^4l4- < 18 > 
J y p{y\H Q ) 

The density functions p(y\Hi), i 6 {0, 1}, are given by ©- 

It can be seen that the conditional density functions p(y\Hi) 
are Gaussian mixtures. Unfortunately, the J-divergence be- 
tween two Gaussian mixture densities does not have a general 
closed-form expression [22] [29]. In order to present the ob- 
jective function in ( fT7Tb in closed-form, approximations must 
be made. An upper bound has been suggested in [29] based 
on the log-sum inequality [11]. However, this upper bound is 
not suitable for the study here, since the dependence on the 
power of transmitted signals is lost in the bound. 

In this paper, the J-divergence of two Gaussian mixture 
densities p(y\Hi) is approximated by the J-divergence of two 
Gaussian densities p g (y\Hi) = Af(y; Mi) £,;). The parameters 
of the Gaussian densities are provided by moment matching, 
i.e., 



Mi = / yp(y\Hi)dy, 

>y 



(19) 



£< = / [y - Mi][y - »i] T p(y\Hi)dy, 

J v 



(20) 



for i G {0, 1}. That is, a Gaussian mixture density is approxi- 
mated by a Gaussian density with the same mean and variance 
as the Gaussian mixture density. 

Obviously the quality of this approximation will directly 
affect the analysis in this paper and the difference between the 
optimal scheme and the proposed scheme, which is optimal for 
the approximated cases. It can be seen from ©-(O that when 
Tr[(HA) T R _1 (HA)] — > 0, the Gaussian mixture density in 
(HJl approaches a Gaussian distribution. So, we can predict 
the approximation will work well for the low SNR cases, and 
simulations in Section [V] show that it still works well even 
with received SNR as high as 10-12 dB. 

We next calculate the means and covariance matrices of the 
Gaussian densities p g (y\Hi), i G {0, 1}. From ®, ( fT9b . and 
the Markov property of the system, 

Mi = / yp{y\Hi)dy, 
Jy 



u 

= £)p(u|JTi) [yp(y\u)dy. (21) 
u J y 

Recall that p(y|u) is a Gaussian density with mean HAu, as 
shown in (0, so 



Mi = ^2HAup{u\Hi 



(22) 



By applying (0]i and (0, we have 

IM = HA/3,, (23) 

for i G {0, 1}, 

0i=J2 M"\Hi) = [Pd(1), ■ • ■ , Pd{K)] t , (24) 

U 

and 

A) = Mu\Ho) = [Pf(1), • • • , Pf{K)] t . (25) 

u 

Similarly, from (|20| > and the Markov property of the system, 
we have 

£ 4 = / [y - Mi][y - m} T p(y\Hi)dy, 

Jy 

= / [y - Hi\[y - m} T ^2p{y\u)p(\i\Hi)dy, 
Jy u 

= 5^p(u|JTi) Ay - HAu + HAu - Mi ] 
u y 

■ [y - HAu + HAu - ^] T p(y|u)dy, 

= R + ^p(u|^)[HAu - [HAu - /i. t ] T . (26) 

u 

The last step follows because p(y|u) is a Gaussian density 
with mean HAu and covariance matrix R. Applying (0]i and 
0, and after some algebra, we obtain 

= R + HAB 4 A T H T , (27) 



where 

B 1 = diag{P D (l)[l-P D (l)],--- ,P D (K)[1-P D ( 
and 

B = diag{P F (l)[l - P F (1)], • • • , P F {K)[l - P F (. 

We next derive the J-divergence between the G 
densities, J(p g (y\Hi),p g (y\Ho)). From the definitioi 
J-divergence and the KL distance in (fTTTi-(fT2]i. we ha 

J(p 9 (y|ff 1 ),p s (y|^o)) 

dy[p 9 (y|^i)-^(y|^o)]iog^#T- 

y Pg{y\ H a) 

Using the fact that p g (y\Hi) are Gaussian d 
Af(y; Hi, i £ {0, 1}, after some algebra, we obtai 

J(p g (y\Hi),p g (y\H )) 



1 

2 Tr 



+ ( s i 1 + E o *) (Mi - Mo) (Mi ~ Mo) J 



N. 



where N is the dimension of the received signal vector y at 
the FC. Using the means fa and covariance matrices Sj in 
d23l l and (f27]i, and after some algebra, we have, 



1 

2* 



J(j> g {y\H x lp a {y\H )) 

R + HA(B + (3{3 T )A T H T 
[R + HABiA T H T ] ~ 1 

R + HA(Bi + /3/3 T )A T H T 



1 

+ 2 Tr 



[R + HAB A T H T ] ' 



N, 



(32) 



where f3 = f3% — /3q. 

The approximated optimal power allocation is the solution 
to the following optimization problem: 



max J(p g (y\H 1 ),p g (y\H )), 



(33) 



s.t. Tr 



AA 



< Pu 



< A X ^Pmax. 

For the objective function given in J32l . the optimization 
is over the amplitude matrix A, or equivalently the power 
allocation among the sensors. The optimization problem can 
be solved by various constrained optimization techniques, and 
in the simulations we use the interior point method [2] [4]. 

IV. Special case with orthogonal channels 

A special case of the distributed detection system depicted 
in Figure Q] is that in which all of the sensors have orthogonal 
channels for communication with the FC. A system diagram 
for this case is shown in Figure [2] 

Compared to the system in Figure Q] this special case has 



H = diag{75l 

R = ( 7 2 I X , 



9k}, 



(34) 
(35) 



Sensor 1 




Channel 1 






y l = a l ,[g l u l + n l 



Sensor K 


"k 


Channel K 


M K — 7k i*K ) 







Fusion 
Center 

»o = r (y) 



Fig. 2. Distributed detection system with orthogonal channels. 




Fig. 3. Illustration of region <S. 



where Ik is a K-by-K identity matrix, and the noises in all the 
channels are independent and have the same variance a 2 . Here, 
<7j is the channel power gain for sensor j. By substituting the 
above two matrices into the optimization problem in d32l-(l33li. 
after some algebra, the power allocation for this special case 
reduces to the solution to the following optimization problem, 



max J(Pi,--- ,P K ) 

{Pu-,Pk} 



K 

E 

3=1 
K 



g 2 + aF(j)9jPj 
<J 2 + Mj)9jPj 



ap{j)9jP 3 



(36) 



s.t. ^P,-<Ptc 

3 = 1 

< Pj < P„ 



3=1, 



,K, 



where 



<*fU) = PfU)(1 - Pd(j)) + Pd(j)(Pd(j) - PfU)), 



a D (j) = P D (j)(l - P F (j)) - PfUKPdU) ~ PfU)), 



PfV) = Pd(J)(1 - PdV)), 



and 



M) = PfU)^-Pf(3))- 



(37) 

), 
(38) 

(39) 
(40) 



Note that Pj = a 2 is the power allocated to sensor j for 
transmitting its findings to the FC. The objective function 
is fully decoupled, a direct result of the orthogonal channels 
between the sensors and the FC. 
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The first order partial derivative of J (Pi, 
respect to Pj is given by 

d 



, Pk ) with and 

c 3 = [Mi) 2 M) - /Mi)] 



dP. 



J{Pi,--- ,Pk) 



(Mi) ~ M))<y 2 gj (Mi) - t3p{j))(J 2 9j 



(41) 



It has a interesting property as stated by the following lemma 



Lemma 2 The first order derivative of the objective function 
J (Pi, - • • , Pk) with respect to Pj is always nonnegative at 
any valid power allocation point Pi > 0. That is 



> 0. 



Pi>0 



(42) 



Proof: See Appendix U ■ 
Lemma [2] tells us that the objective function in (|36] l is 
nondecreasing with increasing power budget Ptot- Since we 
are maximizing a nondecreasing function, the optimal point 
is always at the constraint boundary, i.e., Ylf=i = ^tot, 
or Pj = P m ax(j), j = I,--- ,K. This result is intuitively 
plausible since it makes full use of the power budget. 

Practical sensors should always have Pq > Pp, since, if 
Pd = Pp, the sensors do not provide useful information. With 
this condition, we can easily prove the following corollary. 

Corollary 2 If Pd{]) > Pf(j'), f/zen the first order derivative 
of the objective function J(Pl, • • • , Pr-) wif/t respect to Pj is 
always strictly positive at any valid power allocation point 
P > 0. 

Corollary [2] tells us that there is no stationary point inside the 
constraint boundary, so gradient based optimization techniques 
will not get stuck. 

The second order partial derivative of J (Pi, • • • , Pk) with 
respect to Pj is given by 

52 - ,p/ 



dP 2 



J(Pi 



[MMMj)] Mi) 
W 2 +/3FU)9jPjf 

[MM Mj)] Mj) 
W 2 + 0D(j)9jPj} 3 

c cr e + (>'.,/,/', + cy^Pf + r,,^/;;' 

[^ 2 Mf(jMMVMd(jMM 3 



(43) 



where, 

Co = Mi) [Mi) - Mi)] + Mi) [Mi) - Mi) 

Ci = Mi)Mj) [Mi) - Mi) + Mi) - Mi) 

c 2 = Mi)Mj) [Mi) IMj) - Mi)] 

+ Mi) [Mi)- Mi)] 1, (46) 



(44) 



(45) 



Mjf IMj) " M.?)] 



(47) 



The second order partial derivative in d43l is not always non- 
positive, which means the objective function J (Pi, • • • , Pk) 
is not always concave. However, the following lemma specifies 
the region in terms of local sensor observation quality, where 
the second order derivative of the objective function is indeed 
nonpositive. We again assume that practical sensors have 
Pd > Pf- 

Lemma 3 The second order partial derivative of the objective 
function, ■^p^J{P\, • • • , Pk) < 0, for any allocated power 

Pj > 0, if and only if (Pd(j), Pf(j)) € S, where S is defined 
by 



{S(P D ,P F ) 



4 _ 2 Pf ~ 4 



1 + 12P P 



UP 2 



3 1 1 

^4-2^ + 4 



1 + 12P F -12P2 



< P F < Pd < 1 



(48) 



Proof: See Appendix ITU 
Region S is depicted in Figure [3] in which r\ 



hPp 



- ^y/l + 12P f -12P F and r 2 = 



- hPp 



\y/T+ 12P F - 12 Pp. We will show that, if all the sensors 
operate in region S, the power allocation can be solved by a 
weighted waterfilling algorithm. 

To derive the algorithm, we will use the technique of 
Lagrange multipliers [2] [4]. The Lagrangian associated with 
the constrained optimization problem in (l36l l is 



L(Pu- 



,Pk,X) 
' x «f(j)P 3 




a 2 + a D (j)Pj 



M)Pj a 2 +(3 D (j)P, 



Vj(Pj ~ PnaxO')), 




(49) 



where A, {vj}f—x, an d {ty/}J=i are Lagrange multipliers. The 
Karush-Kuhn-Tucker necessary conditions for optimality [4] 
are 



.Pk) 



X* + v* - 77* = 0, (50) 



K 



a* > o, if ^p* = Pot, 

K 

a* = o, if ]Tp;< p ot , 

v* > 0, if P- = 0, 

v* = o, if p; > o, 



(51) 

(52) 

(53) 
(54) 
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and 



>0, 
= 0, 



if P* = Pn 

if P* < P n 



(55) 
(56) 



All variables with superscript "*" are at their optimal values. 
Since the optimal solution is always on the total power 
constraint boundary as indicated by Lemma |2j ( f52b is inappli- 
cable except in the trivial case when Ylj=i Pmax(j) < -Ptot 
and all the sensors just transmit at full power. So, in other 
words, we consider that the total power constraint is always 
active (meaning Ylj=i Pj — Ptot) and A* is always positive. 
Similarly, v* (or rj*) is positive only when the constraint 
P* > (or P* < P max (j)) is active. 

< [50b is the key equation to solve. For each fixed value of 
A*, we can solve ( f50b to obtain the corresponding P*, v*, and 
r\*- We can then calculate the corresponding ^ 
goal here is to find a A* such that Ylj=i Pj 

Substituting (RTt into i50\ . we have 



Ptot- 



P*. The 



(a F (J) ~ Pf {j)Wg 3 , (a D (j) ~(3 D (j))a 2 gj 
(^ 2 +/3fO-)5,P,) 2 



Let us define 

d 



<9P, 



w h0 



J (Pi 



-A* 



K 



(<7 2 + Mj)s,p,) 2 

h^-r?*=0. 



(57) 



f;=o 



/3 D (j))/a 2 , (58) 



and 



ap 



J(Pi 



p, 



A' 



J 

2, 



(«F (j) -PF(j))<7 2 9j 

(v 2 + P F (j)gjP max U)) 2 

(<*dU) -ppU)W 2 g 3 
(° 2 + 0D{j)g j P m Uj)f 



(59) 



is inactive {v* 



0). A* is monotonically decreasing 



with increasing P*, as long as Wj.i < A* < Wj$. This can 







For sensor j operating at (Pu(j), Pf(j)) G 5, we can see that, 
when A* > w Jt o, we have P* = 0, i]* = 0, and v* = \*—Wj, . 
Sensor j starts to get positive power allocation P* > when 
A* < Wj.o, and in this case the constraint < P* < P m ax(j) (5) 



be easily verified using Lemma [3] since -^p? J(Pi, • • • , Pk) 
is always non-positive for (Pp,(j), Pp(j)) £ S. When A* < 
Wj t i, PJ is now "clamped" at P max (j), so we have v. 
and rjj = wj^i — A* > 0. 

This case is depicted as the dashed line in Figure [4] 
Regardless of the value of A*, we can easily verify that there is 
always a one-to-one mapping (through (l50t or d57b ) between 
P* and A*, and P* is nondecreasing with decreasing A*. 

We have the following observations based on the above 
analysis. (1) If all the sensors operate at (Pu(j), Pf{])) £ 
S, j — 1, ••• ,K, there is a one-to-one mapping between 
J2f=i Pj an d an d SjLi Pj i s nondecreasing with de- 
creasing A. (2) The sensors get positive power allocation with 
increasing power budget (hence decreasing A*) in a sequential 
fashion, and it is determined by u^o- 

The above observations with a two-sensor case are illus- 
trated in Figure [5] Based on the observations, the solution 
can be found through a "weighted waterfilling" procedure, 
specified by the following algorithm. Initially the sensors send 
their local detection quality {Po(i), Pf(J))}^=i to tne FC > 
and then the algorithm is executed at the FC. 
Algorithm 1: 

(1) The FC estimates the channel power gain {gj}j—i of the 
sensors and the noise variance a 2 . 

(2) The FC calculates {w^oljLi using d581 > and ranks them 
such that Wjfi > w J+ i,o. The FC also solves for 
{wj,i}^ =1 using i5% . Then the FC calculates the power 
allocations {Pj^ w . with A* = Wj^ for each j using 

<Ez). 

(3) The FC finds the largest j' such that 
Ptot, and assigns w a = Wj',o and Wb = Wj> 
K, the FC sets Wb = 0. The rest of the algorithm conducts 
a simple line search on A* between w a and Wb such that 

P L\* = Ptot- ^ 

> e, where e is a small 
Otherwise, 

the FC stops the algorithm and broadcasts the desired 
power allocations {P^ w to tne sensors. 

The FC sets w c — (w a + Wb) / % following bisection rule. 
IfY^k=i Pk w — Ptot, the FC sets w a = w c . Otherwise, 
the FC sets Wb = w c . The algorithm goes to (4). 



-iPk 



< 

S',o — 
1,0- Ifj' = 



El 



(4) // Ef=i Pk, Wl 

positive number, the algorithm goes to (5). 



k—l k,w a 



Algorithm 1 can be easily seen to converge because of the 
mono tonicity between total power budget P tot and A*. And the 
simple line search of A* between w a and Wb should converge 
very quickly [2]. 

If sensor j operates at (Pd (j) , Pf (j)) S, A* is mono- 
tonically increasing with P* when P* is small and is mono- 
tonically decreasing with P* when P* grows larger. This is 
because Co is negative and C\, C2, and C3 are nonnegative 
in (143 1 , Thus -gpxJ(P\,'-- , Pk)\ P *_ is positive, and will 
eventually become negative with P* sufficiently large. There- 
fore, J(Pi, • • • , Pk) nas a single local maximum at some 
P* > 0, and it is possible that (|57T > has two solutions for P* 
for a single A*. This case is also shown in Figure H] 

If one or more sensors operate at (Pd(j'), Pp(j)) & S, 
the monotonicity and one-to-one mapping between A* and 
Y^,j=i Pj may be invalid, as shown in Figure [6] Therefore, 
the computationally efficient Algorithm 1 does not work for 
this case. The solution can still be obtained from general 
constrained optimization techniques, such as the interior point 
method [2] [4]. 

V. Simulations 

In this section, numerical results are provided to illustrate 
the power allocation scheme developed in this paper. In the 
simulations, we consider the following settings. There are K 
sensors scattered around an FC and the distances from the 
sensors to the FC are {dk]^ =1 - The pathloss of signal power 
at the FC from sensor k follows the Motley-Keenan pathloss 
model (expressed in dB) without the wall and floor attenuation 
factor [23]: 

PL k = PL + 10nlog 10 (4/do) (60) 

where PLq is a constant set to 55 dB, and do is also a constant 
set to 1 m in the simulations. Here, n is the pathloss exponent, 
which is set to 2 for free space propagation. The channel 
power gain for sensor k is — —PLk in dB. The noise 
variance at the FC is a 2 = — 70 dBm, and we assume the noise 
covariance matrix is R = a 2 lk- The maximum transmitting 
power of each sensor is P max = 2 mW (3 dBm). The 
total power budget in the simulations will be below or equal 
to 2K mW, otherwise each sensor will just use maximum 
transmitting power (a trivial case). All the sensors perform 
Neyman-Pearson detection with false alarm probabilities set 
to Pp(k) = 0.04, k — 1, • • ■ , K. The detection probabilities 
may vary according to their local observation qualities. The FC 
also uses a Neyman-Pearson detector targeting the same false 
alarm probability as the local sensors 2 , i.e., Pf,fc = 0.04. 

We will investigate three scenarios: 1) two sensors with or- 
thogonal MIMO channels. 2) two sensors with non-orthogonal 
MIMO channels. 3) ten sensors with orthogonal MIMO chan- 
nels. 

2 The operation points in terms of targeted false alarm probabilities of the 
detectors at the sensors and the FC can be designed to be different from 
one another, and the analysis in this paper does not require the false alarm 
probabilities to be the same. The optimal design of the operating points of 
the detectors is beyond the scope of this paper, and we use the same false 
alarm probability for the sake of simplicity. 



A. Two Sensors with orthogonal channels 

Two sensors are located d\ — 2 m and d.2 = 5 m away from 
the FC, and communicate with the FC through orthogonal 
channels. Channel gains gi and g^ are calculated by d60l l. and 
are -61 dB and -69 dB respectively. 

We will consider four cases with various local detection 
quality combinations. 

CaselV-Al: P D {1) = 0.1, P D {2) = 0.9. 

Case|\3£: Pd(1) = 0.7, P D {2) = 0.9. 

CaseEAfr Pd(1) = 0.9, P D {2) = 0.9. 

Case|V3: Pd(1) = 0.9, P D {2) = 0.7. 
In Case IV-A1 one sensor does not operate in region S, so 
the interior point optimization algorithm is used in this case. 
In Case IV-AE . Case IV-AB . and Case IV-AB -. both sensors 
operate in region S, thus Algorithm 1 is used. The total power 
budget P to t varies from -14 dBm to 6 dBm (when each sensor 
transmits at full power 2 mW). 

In addition to the proposed power allocation, we also 
include an equal power allocation and an equal received SNR 
power allocation for comparison. The equal power allocation 
simply distributes power equally among the sensors, without 
considering channel or local detection quality. The equal 
received SNR power allocation considers channel quality only 
and distributes power among sensors in such a way that the 
received signals from the sensors have the same SNR. 

Figure [7] shows the proposed power allocation as well as 
the equal power allocation and the equal received SNR power 
allocation, we can see that for Case IV-AH the proposed power 
allocation distributes all the power to sensor 2, until the 
maximum output power is reached for sensor 2, and then 
sensor 1 starts to get positive power allocation. This is because, 
although sensor 1 is closer to the FC (hence it has a better 
channel), its detection quality is much worse than that of 
sensor 2. For Case IV-AE . the detection quality of sensor 1 is 
still worse than that of sensor 2, but the gap is small enough for 
sensor l's better communication channel to show difference. 
The proposed allocation distributes all the power to sensor 1, 
until the maximum output power is reached for sensor 1, and 
then sensor 2 starts to get power allocation. For Case lV-Ar 3 and 
Case IV-AB -. sensor 1 has a better communication channel and 
equal or better local detection quality, so it is not surprising 
to see that the proposed allocation distributes power to sensor 
1 as much as possible, and these two cases have the same 
power allocation as Case IV-AE . The waterfilling effect of the 
proposed power allocation is obvious in this scenario. The 
equal power allocation and the equal received SNR allocation 
do not change between the four cases because they are not 
affected by the local detection quality. 

Recall that the proposed power allocation scheme is based 
on the J-divergence instead of detection probability and false 
alarm probability. Furthermore, the J-divergence between two 
Gaussian mixture distributions, i.e., that of the received signals 
at the FC under the two hypotheses, is approximated in 
this optimization by the J-divergence between two Gaussian 
distributions, with the same means and covariance matrices 
as the Gaussian mixtures. We next show the quality of this 
approximation. 
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-proposed allocation for case V-Al 
—A— proposed allocation for case V-A2 
— T— proposed allocation for case V-A3 
□ proposed allocation for case V-A4 

equal power allocation 
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Fig. 7. Equal power allocation, equal received SNR allocation, and the 
proposed power allocation for the four cases in Section IV-AI (Note that the 
curves for A, V and □ overlay in this graph.) 



-simulated allocation for case V-Al 
simulated allocation for case V-A2 
-T— simulated allocation for case V-A3 
-b— simulated allocation for case V-A4 

equal power allocation 

- - - equal RX SNR allocation 




Fig. 8. Equal power allocation, equal received SNR allocation, and the 
simulated optimal power allocation for the four cases in Section IV-AI (Note 
that the curves for A, V and □ overlay in this graph.) 



Figure [8] shows the optimal power allocation found by 
simulations. The FC uses a Neyman-Pearson detector based 
on the likelihood ratio of the received signal y. The optimal 
power allocation is the one that produces the highest Pd,fc 
for a given total power budget. The optimal power allocation is 
found by a brute-force grid search in a two dimensional space 
of all possible power allocations. For each possible power 
allocation point, 2 x 10 Monte Carlo runs are used to provide 
the corresponding Pd,fc- 

We can see that the simulated optimal power allocations in 
Figure [8] perfectly match the proposed power allocations in 
Figure [7] 

The contours of the approximated J-divergence (used as the 
objective function to develop the proposed power allocation) 
and the simulated Pd,fc for Case IV- AB are plotted in Figure 
[9] and Figure [10] The contours for the other cases are similar 
and are omitted due to limited space. We can see from Figure 
[9] and Figure [10] that the two contours match each other well 
at any power allocation point in this scenario, including the 
case in which either sensor transmits at full power (2 mW or 
equivalently 3 dBm). When sensor 1 transmits at full power 
(3 dBm), the corresponding received SNR is about 12 dB. 



So we can see the approximation works well in this scenario 
even with received SNR as high as 12 dB. This is the reason 
for the perfect match between the proposed allocation and the 
simulated optimal allocation. 

In Figure fTTT- Figure FBI we plot the detection probability 
at the FC Pd.fc as a function of the total power budget 
-Ptot for the four cases, and for the proposed analytical and 
simulated optimal allocation. The performance of the proposed 
power allocation matches that of the simulated optimal power 
allocation very well in all four cases (the two curves overlay 
in the four figures), and it can save almost 3 dB in P tot 
compared to the equal power allocation to achieve the same 
Pd,fc- The equal received SNR power allocation considers 
only the channel quality, so it performs even worse than the 
equal power allocation in Case IV- Al l . where optimally sensor 1 
with a better channel but much worse detection quality should 
use less power than sensor 2. On the other hand, in Case 
IV-AE through Case IV- Al l, the equal received SNR allocation's 
performance is between the proposed allocation and equal 
allocation. 



B. Two sensors with non-orthogonal MIMO channels 

The setting here is similar to the setting in Section IV-AI 
but the data transmission is over non-orthogonal channels. The 
channel matrix is given by 



H 



"1 


p 




.91 


0" 


p 


1 







•92. 



(61) 



<7i and gi are the same as those in Section IV-AI p = 0.2 
is the interference coefficient. We consider four cases, Case 
IV-B1 through Case lV-Bt l, with exactly the same local detection 
quality combinations as those of Case IV-AI through Case 
IV-Aft . The interior point optimization algorithm is used to 
solve the proposed power allocation for all four cases. 

Figure [15] shows the proposed power allocation as well 
as the equal power allocation and the equal received SNR 
allocation. The major difference between Figure[l5]and Figure 
in Section IV-AI is that the waterfilling effect in Case IV-BE 
through Case IV-Bti - is not as obvious as that in Case IV-AE 
through Case IV-Att . The non-orthogonal channel makes the 
contribution from the sensors at the FC dependent. So the 
power allocation is less extreme in the sense that the "better" 
sensors take all the power. 

Figure [16] shows the optimal power allocation found by 
simulations for the four cases. For Case I V-Bl and Case I V-Bl l. 
the simulated optimal power allocations in Figure [16] match 
the the proposed power allocations in Figure [15] For Case 
IV-BE and Case IV-BB . the simulated optimal power allocations 
are different from the proposed power allocation in higher total 
power budget region. For these two cases, Pd,fc as a function 
of power allocation is quite "flat" in the high total power 
region. This can be seen from the wider gaps between contour 
lines in Figure [18] and from the results in Figure [20] and Figure 
|2T1 that the performance of the proposed power allocation is 
still very close to that of the simulated power allocation. So the 
artifact of Monte Carlo trials is more pronounced, and explains 
the difference between the proposed power allocations and the 
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Fig. 9. The contour of the appr oxima ted J- Fig. 10. The contour of the simulated P ,fc for Fi g- 1L The FC detection prob ability P D , FC as 
divergence objective function for Case IV- AB . Case lV-AB . a function of Ptot of Case lV-AL 
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Fig. 12. The FC detection probability Pd,fc as Fig- 13. The FC detection probability Pd,fc as Fig- 14- The FC detection probability Pd,fc as 
a function of Ptot of Case I V-Afc . a function of Ptot of Case |V-Afe . a function of Ptot of Case IV- Al l. 




- simulated allocation for case V-B 1 

- simulated allocation for case V-B2 

- simulated allocation for case V-B3 

- simulated allocation for case V-B4 
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Fig. 15. Equal power allocation, equal received SNR allocation, and the Fig. 16. Equal power allocation, equal received SNR allocation, and the 
proposed power allocation for the four cases in Section I V-B I simulated optimal power allocation for the four cases in Section |V-B1 



simulated optimal power allocation in the high total power 
region for Case lV-BE and Case lV-BB . 

The contours of the approximated J-divergence and the 
simulated Pd.fc f° r Case IV-BE are plotted in Figure [17] 
and Figure [18] The two contours match each other well, but 
the artifact of Monte Carlo trials in Figure [18] is obvious, as 
discussed above. 

In Figure [191-Figure l22l we plot the detection probability 
at the FC Pd.fc as a function of the total power budget 
P t ot> for the four cases. The performance of the proposed 
power allocation matches that of the simulated optimal power 



allocation very well in all four cases (the two curves overlay 
in the four figures). Compared to Figure ITTT- Figure [T4l in 
Section [V-AI the performance gap between the proposed power 
allocation and the equal power allocation (as well as the equal 
received SNR allocation) is slightly narrower. 

C. Ten sensors with orthogonal MIMO channels 

Here we consider ten sensors scattered around an FC. We 
will investigate five cases, according to various sensor distance 
and detection probability combinations. In Case IV-Cl , Case 
IV-CE . and Case IV-Cti -, some of the sensors do not operate in 
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Fig. 17. The contour of the appr oxima ted J- Fig. 18. The contour of the simulated P ,fc for Fig. 19. The FC detection prob ability P D , FC as 
divergence objective function for Case lV-BE . Case lV-Bt . a function of Ptot of Case lV-BL 
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Fig. 20. The FC detection probability Po FC as Fig- 21. The FC detection probability Pn F C as Fig. 22. The FC detection probability Pd FC as 
a function of Ptot of Case IV-Bt . a function of Ptot of Case lV-Br S. a function of Ptot of Case IV-Bl l. 
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TABLE I 

Percentage of the total power allocated to each sensor for 

FOR CASE lV-Cl l. 
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TABLE II 

Percentage of total power allocated to each sensor for case 



region S, so the interior point optimization algorithm is used 
in these three cases. In Case IV-CB and Case IV-C5 . all the 
sensors operate in region S, thus Algorithm 1 is used. The 
total power budget P tot varies from -7 dBm to 1 3 dBm (when 
each sensor transmits at full power 2 mW). 

1) Case W-d : The distance between sensor j and the FC is 
dj = 2 + 0.6(j — 1) m, e.g. d\ = 2 m and d\Q = 7.4 m. Sensor 
j has detection probability of Pd(j) = 0.1 + 0.09(j — 1), e.g. 
Pd(1) = 0.1 and Pd(10) = 0.91. In this case, sensors closer 
to the FC have worse detection probability. 

The percentage of the total power budget P t ot allocated to 
each sensor is shown in Table J] When P tot is low, more power 
is distributed to sensors farther away from the FC. Intuitively 
this is because even though sensors closer to the FC have good 
channel gain, their local detection qualities are much worse 
than those of the farther sensors. With P tot increases, power 
is distributed more evenly among the sensors, because some 



sensors have already reached their maximum output power. 
Eventually, when the total power budget reaches 13 dBm, 
every sensor transmits at P m ax = 3 dBm. 

Figure [23] shows that the approximated and actual (by 
Monte Carlo simulation) J-divergences are very close even 
when every sensor is transmitting at full power. The maximum 
received SNR of the closest sensor at the FC is about 12 dB. 
Similar to Section IV-AI we can see that the approximation in 
J-divergence works well with SNR as high as 12 dB. 

Figure [24] shows the simulated detection probability at the 
FC Pd,fc °f me proposed power allocation and the equal 
power allocation 3 . In this case, the proposed power allocation 

3 For this scenario with 10 sensors, the complexity is too high to find 
the optimal power allocation by brute-force search and simulations in a ten- 
dimensional space of all possible power allocations. So in this scenario, we 
will not include the optimal power allocation found by simulation that gives 
the highest Pd,fc- 
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TABLE III 

Percentage of total power allocated to each sensor for case 
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TABLE V 

Percentage of total power allocated to each sensor for case 
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TABLE IV 

Percentage of total power allocated to each sensor for case 



6 dB saving) of the total power used by the equal power 
allocation to achieve the same detection performance at the 
FC. 

5) CaseWTX: dj = 4 m, and P D (j) = 0.8. In this case, all 
sensors have the same detection probability and distance from 
the FC. This case serves as a sanity check because intuitively 
the sensors should always have equal power allocation in this 
case. Table M and Figure [3T1 verify this intuition. 



can use about 1 dB less power than equal power allocation to 
reach the same detection performance at the FC. 

2) Case \V-C\ >: The distance between sensor j and the FC 
is dj = 2 + 0.6(j — 1) m. Sensor j has detection probability of 
P D (j) = 0.4 + 0.06(j - 1), e.g. Pd(1) = 0.4 and P D {10) = 
0.94. In this case, sensors closer to the FC still have worse 
detection probability, but the gap in local detection qualities 
is not as large as that in case 1. 

In this case, Table [II] shows that when P t ot is low, more 
power is distributed to sensors closer to the FC. The advantage 
in channel gain of sensors closer to the FC has offset their 
disadvantage in detection quality. The water filling effect is 
obvious here, and the sensors get positive power allocation in 
a sequential fashion. 

In Figure [25] the approximated and actual J-divergences are 
still very close. Figure[26]shows the proposed power allocation 
has a maximum power saving of 4 dB (more than 50%) 
compared to equal power allocation. 

3) Case IFCfr: dj = 2 + 0.6(j - 1) m, and P D (j) = 0.8. 
In Table [HI] power allocation is even more biased toward 

sensors closer to the FC compared to Case IV-CE . All the 
sensors now have the same detection quality, but the sensors 
closer to the FC have the advantage in channel gain. 

In Figure [27] the approximated and actual J-divergences 
start to show some difference. Figure [28] shows the proposed 
power allocation has a maximum power saving of more than 
5 dB. 

4) CaseWT^: dj = 2 + 0.6(j-l) m, and P D (j) = 0.94- 
0.06(j — 1). In this case, the sensors closer to the FC have 
advantage in both channel gain and local detection quality. So 
Table II V I power allocation is even more biased toward sensors 
closer to the FC compared with Case IV-CB . 

In Figure [29] the approximated and actual J-divergences 
have more difference than the previous three cases, but their 
shapes are still quite similar. Figure[30]shows that the proposed 
power allocation consumes only less than 25% (has more than 



VI. Conclusions 

In this paper we have studied the problem of optimal power 
allocation for distributed detection over MIMO channels in 
wireless sensor networks. Our contribution is novel compared 
to the pervious work in the following senses: (1) We have 
considered a distributed detection system with a MIMO chan- 
nel to account for non-ideal communications between the 
sensors and the FC. (2) We have assumed that there are a 
finite number of sensors and the sensors have independent 
but nonidentically distributed observations. (3) We also have 
assumed both individual and joint constraints on the power 
that the sensors can expend to transmit their local decisions to 
the FC. (4) We have developed a power allocation scheme to 
distribute the total power budget among the sensors so that the 
detection performance at the FC is optimized in terms of the 
J-divergence. (5) The proposed power allocation is a tradeoff 
between the quality of the local decisions of the sensors 
and the quality of the communication channels between the 
sensors and the FC. Simulations show that, to achieve the 
same detection performance at the FC, the proposed power 
allocation can use as little as 25% of the total power used by 
equal power allocation. 



Appendix I 
Proof of Lemma[2] 

Proof: Taking the derivative of J{P\, ■ ■ ■ , Pk) in 
with respect to Pj, we have 

d , 
-J(Pi,--- ,Pk) 



dPj N ' ""' {Pi}? =1 , -P,>o 

= (<*f(,7) ~ (3 F (j)W9j (ap(j) - f3p(j))<J 2 gj 

[a* + (3 F {j)gjPjY + (a* + (3 D (j) 9j P 3 y 
= c[(a F (j) - f3 F (j))(a 2 + M)gjPj) 2 

+ (a D (j) - p D (j))(a 2 + p F (j)gjPjf] , 
= c[d a 4 + 2d 1 a 2 g j P j + d 2 g 2 P 2 ] , (62) 
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Fig. 23. Approximated and simulated J-divergence 
as a function of Ptot for the proposed power 
allocation of case I V-J l . 




P {dBm) 



Fig. 24. Simulated Pd,fc as a function of Ptot 
for the proposed power allocation and equal power 
allocation of case I V-Ct . 
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Fig. 25. Approximated and simulated J-divergence 
as a function of P to t for the proposed power 
allocation of case lV-Ct . 
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Fig. 26. Simulated Pb,fc as a function of Ptot 
for the proposed power allocation and equal power 
allocation of case IV-Ct . 
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Fig. 29. Approximated and simulated J-divergence 
as a function of Ptot for the proposed power 
allocation of case IV-Ct t. 



Fig. 27. Approximated and simulated J-divergence 
as a function of Ptot for the proposed power 
allocation of case lV-CT j. 
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Fig. 30. Simulated Pd,fc as a function of Ptot 
for the proposed power allocation and equal power 
allocation of case IV-Cft . 



Fig. 28. Simulated Pb,fc as a function of Ptot 
for the proposed power allocation and equal power 
allocation of case lV-CT ?. 
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Fig. 31. Simulated Pd,fc as a function of Ptot 
for the proposed power allocation and equal power 
allocation of case IV-CE . (Note that the curves for 
o and V overlay in this graph.) 



where 



and 



do = a F (j) + a D (j) - f3 F (j) - f3 D {j), 

da = a F (j)/3 D (j) + a D {j)p F {j) - 2p F (j)/3 D (j), 

u F (3)P D {j) 2 +a D {j)fi F ( 3 f 

-MjfMj)-Mj)/fe0f- 



(63) 

(64) 
(65) 



(66) 



Substituting a F (j), (3 F (j), a D (j), (3o(j) into equations 
(|37| | - (l40l >. and after lengthy algebra, we have 



d 
di 



2(P D (j)~P F (j)) 2 >0, 
(Pn{j) ~ Pf(3)) 2 \PdU)P F U) 



+ (1 - P D (f))(l - P f (j)) 



>0, 



(67) 



(68) 
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and 

d 2 = (Pd<J) - PfU)) 2 [(PdU) + Pf(J)) 

(P d (j) + P F (j) - 1 - PdU)Pf(j)) 2 
+ P D (jfp F (jf(2- P D (j) - P F (j)) 



>0. (69) 



Since c and Pj are also nonnegative, we conclude that 

d 



dP> 



J(Pi,---,Pk) 



> 0. 



(70) 



Appendix II 
Proof of Lemma[3] 

Proof: From (|43| ). we can easily see that 
^nJ{Pi, ■ ■ ■ , Pk) < if and only if d > 0, i = 0, • • • , 3. 
From the proof of Lemma|2] we know that C, > 0, i = 1, 2, 3, 
so J(Pi, ■ • • , Pk) < if and only if C > 0. 

3 

We have 

Co = F (j) [a F (j) - Pf(j)] + D (j) [odU) ~ M) 

(71) 

Substituting a F {j), f3 F (j), a F {j), and (3 F (j) into (O - (|40l 
and after some algebra, we have 

Co= (P (i)-Pi,(j)) 2 [-2P (i) 2 



(3 - 2P F (j))P D (j) + 3P F (i) - 2Pp(j') 2 - 1 



(72) 



Now, Co > if and only if 

-2P I) (i) 2 + (3-2P i? (i))P i3 (j) 

+ 3P F (j)-2Pf(j) 2 -l >0. 



(73) 



Solving the above quadratic inequality leads to Lemma [3] 
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